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Abstract 

Alternative splicing is a major contributor to cellular diversity. Therefore the identification and quantification of 
differentially spliced transcripts in genome-wide transcript analysis is an important consideration. Here, I review the 
software available for analysis of RNA-Seq data for differential splicing and discuss intrinsic challenges for differential 
splicing analyses. Three approaches to differential splicing analysis are described, along with their associated software 
implementations, their strengths, limitations, and caveats. Suggestions for future work include more extensive 
experimental validation to assess accuracy of the software predictions and consensus formats for outputs that 
would facilitate visualizations, data exchange, and downstream analyses. 
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What is differential splicing analysis and why is it 
useful? 

With the explosion of transcriptome data derived 
from RNA-Seq, it has become apparent that alternative 
splicing is a major contributor to cellular diversity in both 
normal tissues and disease [1-5]. Sets of genes that are 
regulated by alternative splicing are often different from 
those that are regulated by differential expression [6] 
and tend to highlight different biological processes [7,8]. 
Therefore, differential splicing complements differential 
gene expression in genomic-level descriptions of gene 
regulation in biological systems. 

Alternative splicing (AS) is prevalent in multicellular 
organisms, affecting approximately 50%-60% of genes in 
Arabidopsis [9] and approximately 90%-95% of genes in 
mammals [10,11]. It includes exon skipping, intron 
inclusion, mutually exclusive exons, and alternative 
5' or 3' splice sites for an included exon. Along with 
alternative promoter usage and alternative polyadenylation 
site usage, AS allows multiple mRNA variants, or 
isoforms, to be produced by a single gene [10-18]. AS 
isoforms generate regulatory and functional diversity, 
differing in untranslated regions that regulate transcript 
localization, stability, or translation, or in regions encoding 
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protein-protein interactions or sites for post-translational 
modification [19-21]. AS analysis describes the alternative 
use of exons and splice sites within a single gene or 
sample. AS databases, which are listed at [22], and 
software for AS analysis, which is reviewed elsewhere 
(e.g., [23]), are beyond the scope of this discussion, 
which is focused on differential splicing. 

Differential splicing analysis describes the differences 
in AS site usage between two samples. This is critical for 
studies involving mechanisms of AS and its regulation. 
In addition, differential splicing analysis describes the 
differences in splice isoforms between two samples, 
uncovering functional diversity that is missed by differential 
gene expression analysis. Available software tools and 
packages take conceptually different approaches that 
identify differential splicing at the level of the gene, 
the exon, or both. The choice of which approach and 
software is the best to use for a given analysis depends on 
the experimental objectives and expected outcomes. Table 1 
summarizes the software under review, highlighting the 
strengths of each, their limitations, and directions for future 
improvements. 

The challenges for differential alternative splicing 
analysis 

Splicing analysis requires more input data than gene 
expression analysis because a given gene often has several 
splice isoforms. A sequencing read can map anywhere 



© 2014 Hooper; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative 
Commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Hooper Human Genomics 2014, 8:3 
http://www.humgenomics.eom/content/8/1/3 



Page 2 of 6 



Table 1 Summary of differential splicing analysis software 
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along a gene and count towards expression, but a read 
must include the AS region to count towards splicing 
analysis. In a recent human study, 100- to 150-M reads 
(100 nt, paired end) were needed to detect 80% of AS 
events, and >400-M reads were needed to detect 80% of 
AS differences between two conditions [40]. Given 
current sequencing costs and computational approaches, 
statistically robust detection of differential splicing is 
biased towards the more abundant transcripts. 

The problem is compounded because many genes have 
multiple AS choices, and each resulting isoform may have 
biologically unique properties. To reconstruct which 
full-length isoforms are present in what amounts requires 
either full-length sequencing of transcripts or full 
knowledge of all possible isoforms as well as considerable 
modeling based on the shorter reads. In the future, 
when high-throughput full-length transcript sequencing 
becomes cost-effective, splicing analysis will undoubtedly 
be based on comparison of isoforms. However, most 
current approaches ignore isoform reconstruction in favor 
of local analysis. 

The challenges associated with differential splicing 
analysis and the following discussion of differential splicing 
analysis software apply equally to mammalian systems and 
to organisms with more compact genomes. The primary 
difference is that better performance is expected for 
all approaches in compact genomes, due to their lower 
complexity [41]. 

Three approaches to differential splicing 
analysis — what does each do? 

If the experimental objective is to identify the relative 
abundance of the encoded alternative protein products, 
along with linked regulatory information (alternative 
promoters, alternative 3' untranslated regions (UTRs)), 
then isoform modeling with either Cuffdiff 2 [25] or MISO 
[27] is most appropriate. Here, all reads that have been 
mapped onto a gene are distributed between its isoforms, 
producing a probabilistic model of the frequency of each 
isoform in the original sample. While both MISO and 



Cuffdiff 2 rely on annotated files of splice isoforms, Cuffdiff 
can also incorporate novel splicing events that it discovers 
in the mapped sequencing files. Cuffdiff 2 returns a list of 
differentially spliced genes, with associated p values and 
false discovery corrections, but with no indication of the 
exons/j unctions involved. Many genes are filtered out due 
to low coverage and/or wide confidence limits of isoform 
predictions. MISO uses different statistics and less filtering 
to return a list of differentially spliced genes. It includes 
detailed information on the detected splicing differences 
such as which exon/j unctions are involved, alternative splice 
type (skipped exon, mutually exclusive exons, retained 
intron, alternative 5' splice site, and alternative 3' splice 
site), magnitude of difference, and coverage. One drawback 
of MISO is that it lacks statistical methods for handling 
groups of samples. Thus, MISO gives richer information 
in two-way comparisons, while Cuffdiff 2 incorporates 
the added statistical power of multiple samples and/or 
biological replicates. 

For both MISO and Cuffdiff 2, genes with many 
isoforms present a statistical problem. As the number 
of isoforms increases, so do the degrees of freedom 
in the model, such that the confidence intervals for 
each isoform also increase. Based on this theoretical 
consideration, isoform reconstruction methods are expected 
to be biased against detection of differential splicing 
in genes with many isoforms. 

If the experimental objective is more local, dealing 
with inclusion of specific exons or splice sites, then 
several choices are available. DEXSeq [23,29] and DSGseq 
[22,31] focus on exon usage; they equate differential usage 
of non-terminal exons with differential splicing. Read 
densities are calculated for every annotated exon in 
the genome, data dispersion is modeled, and junction 
information is ignored. Neither will do a simple two-sample 
comparison; they both require groups of samples and 
look for differences between groups. They calculate the 
probability that usage of each exon is different by compar- 
ing read densities across the gene between sample groups. 
DEXSeq reports p values for differential usage of each 
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exon, with corrections for false discovery and with associ- 
ated fold change. DSGseq reports differential exon usage, 
expressed by a novel NB statistic, for each exon and for 
each gene, and includes the mean read density across 
the gene (coverage). DSGseq reports no quantification 
(e.g., fold change) of the differential exon usages. 

The exon-centric approach of DEXSeq and DSGseq is 
immune to isoform complexity. It is expected to be 
biased against small exons; this includes alternate 5' or 
3' splice sites, which are defined as separate exons. 
Small exons will have lower read counts than large 
exons, and thus more variability in the relative read 
densities that are used to calculate differential usage. In 
addition, the normalization across a gene that corrects 
for differences in gene expression levels between samples 
can introduce artifacts. For instance, when a gene has 
alternative sites for transcription initiation or polyadenyla- 
tion that generates isoforms with substantially different 
lengths, and those isoforms are also differentially expressed, 
these events can be misidentified as differential splicing. 

The performance of DEXSeq and DSGseq is heavily 
dependent on the annotation files, as they only consider 
annotated exons. RefSeq annotations [42] are conservative 
and exclude many exons and alternative splices that 
are likely to be of interest. ENSEMBL and GENCODE 
(the human ENSEMBL genebuild) annotations [43,44] 
not only include many more exons and alternative splices, 
but also include gene fusions (read-through from one 
gene to the next) that confound normalization for gene 
expression levels. Thus, analyses based on RefSeq annota- 
tions will miss many 'interesting' events, while those based 
on ENSEMBL will lose statistical significance through 
extensive multiple testing corrections. Therefore, the anno- 
tation used should be selected depending on experimental 
objectives, and it might be prudent to run parallel analyses 
with each annotation file. 

SplicingCompass [25,37] shows great promise as a refine- 
ment of DEXSeq and DSGseq. It includes junction 
information that improves performance on small exons 
and can accommodate novel (un-annotated) splicing. It 
detects differential splicing on the gene level instead of 
considering exons individually. This better accounts for 
combined effects and reduces the number of statistical 
tests, thereby reducing the need for multiple testing 
corrections. On the other hand, it returns information 
only at the gene level and provides no insight into either the 
isoforms (with encoded protein products) or which exons 
or splices are involved. Future releases incorporating 
these features are very welcome. 

MATS [33] takes the most local view, calculating exon 
inclusion (percent spliced in (PSI)) from junction and 
exon reads for the exon and its two flanking exons. It 
then compares values between samples or conditions 
(groups of samples) to return the probability of differential 



splicing, expressed as PSI. It recognizes the type of alterna- 
tive splicing (e.g., skipped exon and retained intron) and 
returns both p value and magnitude (zlPSI) for each 
alternative splice in the dataset. It can perform either 
two-sample comparisons or look for differences between 
groups of samples. This approach has high precision (low 
false-positive rate), at the expense of some sensitivity [33]. 
It should be unaffected by isoform complexity, especially 
alternative promoter or 3 '-end usage, and should perform 
equally well on both annotated and novel splice events. 
However, the use of flanking exons means MATS is 
blind to events involving the first or last exon, which 
contains the untranslated regions (UTRs) where most 
post-transcriptional regulatory sequences are found. 
The precise description of the type of alternative splicing 
is especially valuable where the experimental ques- 
tion involves splicing mechanisms, while the under- 
representation of UTRs is problematic if the experimental 
question involves post-transcriptional regulation. 

DiffSplice [35] takes a related approach based on 
alternative splicing modules (ASMs), regions where 
transcript isoforms diverge. The exon and splice junction 
data for each gene are represented as a splicing graph 
(splicing map) that is built from sample data without 
relying on annotated transcriptomes or pre-determined 
splice patterns. Alternative splicing generates branches in 
the splicing graph, which are usually local and modular 
(ASMs). Thus, a given gene may have several ASMs, each 
involving one or more alternative exons and splice 
junctions. DiffSplice then utilizes a non-parametric permu- 
tation test to identify significant differences in expression at 
both gene level and ASM level. Thus, it performs 
equally well on annotated and novel events, as well as 
splices involving transcript ends, and is unaffected by 
isoform complexity. Its precision and sensitivity await 
rigorous evaluation. 

Usage and user interfaces 

All of these packages take as input mapped reads (.bam 
files) and an annotation (.gtf ) file. Only AltAnalyze offers 
a graphical user interface. The rest are run from the 
command line. MATS and Cuffdiff 2 offer the most 
streamlined use. Each has a wrapper script that is called 
with a single command, as well as excellent documenta- 
tion at their respective websites. They include detailed 
instructions for use at the command line, clear explanations 
of the optional parameters, and step-by-step instructions to 
insure that input data are appropriately formatted. The out- 
puts of these two packages are user-friendly; the .txt tables 
are well labeled and intuitive. Cuffdiff 2 also offers 
CummeRbund [25], an R package for downstream analyses 
and visualizations. 

DiffSplice offers a wrapper script that will run the three 
steps of the analysis, or each step can be run individually. 
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The descriptions of usage are clear, but the discussion of 
optional parameters and data interpretation are still under 
construction. The visualization is a major asset of 
this software package— tracks of read densities and 
the deduced alternative splices can be uploaded and 
viewed on the UCSC genome browser. 

DEXSeq, DSGseq, and SplicingCompass run in R, so 
they are relatively slow. Each requires multiple steps: 
reformatting the input annotation and .bam files, building 
exon count files, and running the differential analysis. For 
DSGseq, the documentation and usage at the website is 
straightforward; it offers no options and outputs a 
single data Table R file. DEXSeq offers multiple options 
for customization, but usage must be gleaned from a com- 
bination of the documentation available at Bioconductor, 
a tutorial, and an example. The key output data table 
is buried amongst all the metadata and supporting 
calculations. DEXSeq provides excellent visualizations; 
it builds a directory of .html files, one per gene, that 
illustrate a reconstruction of alternative splicing across sam- 
ples and with supporting numbers. The SplicingCompass 
tutorial gives straightforward instructions for the 44 
commands necessary to prepare and run a six-sample 
analysis. It also includes a minimal visualization tool 
that will plot the read coverage of all samples and groups 
across a gene. 

MISO offers remarkable versatility and remains user- 
friendly in spite of the resulting complexity. This is largely 
due to the excellent documentation at their website. Its use 
requires three sequential steps: analyzing events' in each 
sample, computing PSI for each sample, and computing 
differences between samples. There are multiple outputs, 
depending on the analysis options selected, which include 
a rich variety of details ranging from splicing event 
type and coverage to PSI and magnitude of differences. It 
includes two very useful utilities for downstream analysis. 
Outputs can be filtered based on their coverage and/or 
magnitude of change. There is also an excellent visualization 
utility that shows the modeling and statistical results 
alongside the raw RNA-Seq data as it maps across 
exons and junctions. This combination of versatility, 
ease of use, detailed outputs, and downstream utilities 
makes MISO a very attractive package. 

Limiting factors — sensitivity, accuracy, and 
validation 

All of these approach methods are limited by sensitivity, a 
weakness that is common to many or all RNA-Seq-based 
analyses [40,41]. At commonly used data depths (e.g., 50- 
to 100-M reads, 50-100 nts), RNA-Seq analyses perform 
well on highly expressed and thoroughly annotated genes, 
but less well on genes at moderate or low expression levels 
and/or with incomplete annotations [41]. The problem 
with sensitivity can be partially overcome by increasing 



read depth or read length [40], especially when excellent 
annotations are available to help distinguish signal' from 
'noise' [41,45]. One alternative might be to develop a 
secondary analysis based on pooling data from samples or 
conditions that show high correlation in the primary 
analysis, thereby increasing effective read depth and 
sensitivity. Another possibility might be to make more 
effective the use of the information in paired ends to 
infer exon usage between the paired mapped reads. 

Exon junction microarrays (Affymetrix HJAY or MJAY 
arrays, Santa Clara, CA, USA [46]) offer an oft-overlooked 
approach with higher sensitivity than RNA-Seq for 
well-annotated alternative splices. A software package 
that integrated RNA-Seq and Exon junction array data 
would be invaluable, where the experimental objective is a 
comprehensive description of splicing and splicing differ- 
ences. AltAnalyze [39], which was designed to process 
splice-sensitive array data, is a unified and flexible package 
that can process raw microarray data (or RNA-Seq data in 
.bed format), compute differences in alternative splicing 
(exon and junction usage), and integrate with a multitude 
of other data types (e.g., protein domains and microRNA 
binding annotations). It offers an interactive graphical 
user interface that allows users with little knowledge 
of bioinformatics programs or scripting to customize 
analyses for their data and to visualize outputs in 
Cytoscape or other useful formats. While AltAnalyze 
can be applied to RNA-Seq data, it only analyzes 
known alternative splicing, which limits its attractiveness 
in RNA-Seq applications. 

For all of these analysis tools and packages, the goal is 
accuracy, which is limiting false-positives while maximizing 
true-positives. Accuracy is best determined by extensive 
experimental validation, for instance, by using quantitative 
reverse transcribed-polymerase chain reaction (qRT-PCR), 
to measure the ratios of alternative splice products in 
the same RNA samples that were used for RNA-Seq 
(or microarray probing). High-throughput qRT-PCR 
technologies allow validation at genomic scale (e.g., [5]). 
MATS used qRT-PCR to validate 164 detected alternative 
splices that covered the range of false discovery rates [33] . 
They found that their calculated false discovery rate (fdr) 
predicted validation rates; for example, splices with 
fdr < 0.2 had >80% validation rate. SplicingCompass 
showed promise for accuracy in extensive in silico analyses 
[37], but it remains to be experimentally tested. The global 
accuracy of DEXSeq, DSGseq, DiffSplice, and Cuffdiff 2 
predictions has yet to be experimentally tested. Given 
the complementary nature of the various approaches 
described above, it is recommended to try more than one 
software tool on a dataset and then compare and/or 
amalgamate the outputs. 

This survey has been cursory, and other good splicing 
analysis software are available [23]. For instance, GLiMMPS 
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is a newly developed statistical method for detecting 
splicing quantitative trait loci (sQTLs) from RNA-Seq 
data [47]. Two key goals downstream of alternative 
splicing analysis are to identify the regulatory differences 
in the transcript isoforms and the functional differences in 
the encoded protein isoforms. In this respect, ASPicDB 
(http://www.caspur.it/ASPicDB/) is a database of human 
protein variants generated by AS [48]. AltAnalyze has 
begun to address the consequences of AS by incorporating 
protein domain annotations into its integrated analysis. A 
useful development would be standardized format(s) 
for outputs of differential splicing analyses, like the .bam 
or .bed formats for genomic information. This would 
facilitate development of downstream modules that 
could be applied to any differential splicing analysis 
for further exploration of functional consequences of 
variations in splicing. 

Abbreviations 

AS: Alternative splicing; ASMs: Alternative splicing modules; fdr: False 
discovery rate; PSI: Percent spliced in; qRT-PCR: Quantitative reverse 
transcribed-polymerase chain reaction; UTRs: Untranslated regions. 

Competing interests 

The author declares that she has no competing interests. 
Authors' information 

JH is trained in and built a career as an experimentalist in developmental 
biology, molecular genetics, and signal transduction. She has recently begun 
to employ genomic and bioinformatic approaches to developmental 
biology. She approaches the topic of software as an experienced biologist 
and a bioinformatics novice. 

Acknowledgements 

I thank Trevor Williams, Ken Jones, Larry Hunter, and the UCD Computational 
Biosciences Program for intellectual support and advice. I also thank Trevor 
Williams and Anis Karimpour-Fard for critical comments on this manuscript. 
JH was supported by NIH via a re-entry supplement DE01 2728-1 4S1. 

Received: 2 September 2013 Accepted: 26 December 2013 
Published: 21 January 2014 

References 

1. Orengo JP, Cooper TA: Alternative splicing in disease. Adv Exp Med Biol 
2007, 623:212-223. 

2. Venables JP, Klinck R, Koh C, Gervais-Bird J, Bramard A, Inkel L, Durand M, 
Couture S, Froehlich U, Lapointe E, Lucier J-F, Thibault P, Rancourt C, 
Tremblay K, Prinos P, Chabot B, Elela SA: Cancer-associated regulation of 
alternative splicing. Nat Struct Mol Biol 2009, 16:670-676. 

3. David CJ, Manley JL: Alternative pre-mRNA splicing regulation in cancer: 
pathways and programs unhinged. Genes Dev 2010, 24:2343-2364. 

4. Kelemen O, Convertini P, Zhang Z, Wen Y, Shen M, Falaleeva M, Stamm S: 
Function of alternative splicing. Gene 2013, 514:1-30. 

5. Ohta S, Nishida E, Yamanaka S, Yamamoto T: Global splicing pattern 
reversion during somatic cell reprogramming. Cell Rep 2013, 5:357-366. 

6. Pan Q, Shai O, Misquitta C, Zhang W, Saltzman AL, Mohammad N, BabakT, 
Siu H, Hughes TR, Morris QD, Frey BJ, Blencowe BJ: Revealing global 
regulatory features of mammalian alternative splicing using a 
quantitative microarray platform. Mol Cell 2004, 16:929-941. 

7. Dittmar KA, Jiang P, Park JW, Amirikian K, Wan J, Shen S, Xing Y, Carstens RP: 
Genome-wide determination of a broad ESRP-regulated posttranscriptional 
network by high-throughput sequencing. Mol Cell Biol 201 2, 32:1468-1482. 

8. Warzecha CC, Jiang P, Amirikian K, Dittmar KA, Lu H, Shen S, Guo W, Xing Y, 
Carstens RP: An ESRP-regulated splicing programme is abrogated during 
the epithelial-mesenchymal transition. EMBO J 2010, 29:3286-3300. 



9. Marquez Y, Brown JWS, Simpson C, Barta A, Kalyna M: Transcriptome 
survey reveals increased complexity of the alternative splicing landscape 
in Arabidopsis. Genome Res 201 2, 22:1 184-1 1 95. 

10. Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ: Deep surveying of alternative 
splicing complexity in the human transcriptome by high-throughput 
sequencing. Nat Genet 2008, 40:1413-1415. 

11. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, 
Schroth GP, Burge CB: Alternative isoform regulation in human tissue 
transcriptomes. Nature 2008, 456:470-476. 

12. Chow LT, Gelinas RE, Broker TR, Roberts RJ: An amazing sequence 
arrangement at the 5' ends of adenovirus 2 messenger RNA. Cell 1977, 
12:1-8. 

13. Berget SM, Moore C, Sharp PA: Spliced segments at the 5' terminus of 
adenovirus 2 late mRNA. Proc Natl Acad Sci USA 1977, 74:3171-3175. 

14. Blencowe BJ: Alternative splicing: new insights from global analyses. 
Ce//2006, 126:37-47. 

15. Nilsen TW, Graveley BR: Expansion of the eukaryotic proteome by 
alternative splicing. Nature 2010, 463:457-463. 

16. Chen M, Manley JL: Mechanisms of alternative splicing regulation: 
insights from molecular and genomics approaches. Nat Rev Mol Cell Biol 

2009, 10:741-754. 

17. Kornblihtt AR, Schor IE, Alio M, Dujardin G, Petrillo E, Munoz MJ: Alternative 
splicing: a pivotal step between eukaryotic transcription and translation. 
Nat Rev Mol Cell Biol 201 3, 1 4:1 53-1 65. 

18. Gruber AR, Martin G, Keller W, Zavolan M: Means to an end: mechanisms 
of alternative polyadenylation of messenger RNA precursors. 

Wiley Interdiscip Rev RNA 2013. doi:10.1002/wrna.1206. 

1 9. Gabut M, Samavarchi-Tehrani P, Wang X, Slobodeniuc V, O'Hanlon D, Sung H-K, 
Alvarez M, Talukder S, Pan Q, Mazzoni EO, Nedelec S, Wichterle H, Woltjen K, 
Hughes TR, Zandstra PW, Nagy A, Wrana JL, Blencowe BJ: An alternative 
splicing switch regulates embryonic stem cell pluripotency and 
reprogramming. Cell 201 1, 147:132-146. 

20. Ellis JD, Barrios-Rodiles M, Colak R, Irimia M, Kim T, Calarco JA, Wang X, 
Pan Q, O'Hanlon D, Kim PM, Wrana JL, Blencowe BJ: Tissue-specific 
alternative splicing remodels protein-protein interaction networks. 
Mol Cell 201 2, 46:884-892. 

21. Buljan M, Chalancon G, Eustermann S, Wagner GP, Fuxreiter M, Bateman A, 
Babu MM: Tissue-specific splicing of disordered segments that embed 
binding motifs rewires protein interaction networks. Mol Cell 2012, 
46:871-883. 

22. EURASNET (European alternative splicing network): Alternative Splicing 
Databases, [http://www.eurasnet.info/tools/asdatabases] 

23. Feng H, Qin Z, Zhang X: Opportunities and methods for studying alternative 
splicing in cancer with RNA-Seq. Cancer Lett 201 2, 340(2):1 79-91 . 

24. Cufflinks — transcript assembly, differential expression, and differential 
regulation for RNA-Seq. [http://cufflinks.cbcb.umd.edu/] 

25. Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L: 
Differential analysis of gene regulation at transcript resolution with 
RNA-seq. Nat Biotechnol 2012, 31:46-53. 

26. MISO: probabilistic analysis of RNA sequencing data for identifying 
isoform regulation. [http://genes.mit.edu/burgelab/miso/docs/] 

27. Katz Y, Wang ET, Airoldi EM, Burge CB: Analysis and design of RNA 
sequencing experiments for identifying isoform regulation. Nat Methods 

2010, 7:1009-1015. 

28. DEXSeq — inference of differential exon usage in RNA-Seq. [http://www. 
bioconductor.org/packages/release/bioc/html/DEXSeq.html] 

29. Anders S, Reyes A, Huber W: Detecting differential usage of exons from 
RNA-seq data. Genome Res 2012, 22:2008-2017. 

30. DSGseq: a useful tool for identifying differentially spliced genes from 
two groups of RNA-seq samples, [http://bioinfo.au.tsinghua.edu.cn/ 
software/DSGseq^ 

31. Wang W, Qin Z, Feng Z, Wang X, Zhang X: Identifying differentially spliced 
genes from two groups of RNA-seq samples. Gene 2012, 51 8(1 ):1 64-70. 

32. Multivariate analysis of transcript splicing (MATS). [http://rnaseq-mats. 
sourceforge.net/] 

33. Shen S, Park JW, Huang J, Dittmar KA, Lu Z-X, Zhou Q, Carstens RP, Xing Y: 
MATS: a Bayesian framework for flexible detection of differential alternative 
splicing from RNA-Seq data. Nucleic Acids Res 2012, 40:e61. 

34. Hu Y, Huang Y, Du Y, Orellana CF, Singh D, Johnson AR, Monroy A, Kuan P-F, 
Hammond SM, Makowski L, Randell SH, Chiang DY, Hayes DN, Jones C, 
Liu Y, Prins JF, Liu J: DiffSplice: the genome-wide detection of 



Hooper Human Genomics 2014, 8:3 
http://www.humgenomics.eom/content/8/1/3 



Page 6 of 6 



35. 



36. 



37. 



38. 



39. 



40. 



41. 



42. 



43. 



44. 



45. 



47. 



differential splicing events with RNA-seq. [http://www.netlab.uky.edu/ 
p/bioinfo/DiffSplice] 

Hu Y, Huang Y, Du Y, Orellana CF, Singh D, Johnson AR, Monroy A, Kuan P-F, 
Hammond SM, Makowski L, Randell SH, Chiang DY, Hayes DN, Jones C, Liu Y, 
Prins JF, Liu J: DiffSplice: the genome-wide detection of differential splicing 
events with RNA-seq. Nucleic Acids Res 2013, 41 :e39-e39. 
SplicingCompass: differential splicing detection using RNA-Seq data. 
[http://www.ichip.de/software/SplicingCompass.html] 
Aschoff M, Hotz-Wagenblatt A, Glatting K-H, Fischer M, Eils R, Konig R: 
SplicingCompass: differential splicing detection using RNA-Seq data. 
Bioinformatics 201 3, 29:1 141-11 48. 

Salomonis N, Emig D, Albrecht M, Hanspers K, Pico A, Conklin BR: 
AltAnalyze— alternative splicing analysis tool, [http://www.altanalyze.org/] 
Emig D, Salomonis N, Baumbach J, LengauerT, Conklin BR, Albrecht M: 
AltAnalyze and DomainGraph: analyzing and visualizing exon expression 
data. Nucleic Acids Res 2010, 38(Web Server issue):W755-W762. 
Liu Y, Ferguson JF, Xue C, Silverman IM, Gregory B, Reilly MP, Li M: 
Evaluating the impact of sequencing depth on transcriptome profiling in 
human adipose. PLoS ONE 2013, 8:e66883. 

SteijgerT, Abril JF, Engstrom PG, Kokocinski F, Consortium RGASP, Abril JF, 
Akerman M, Alioto T, Ambrosini G, Antonarakis SE, Behr J, Bertone P, 
Bohnert R, Bucher P, Cloonan N, Derrien T, Djebali S, Du J, Dudoit S, 
Engstrom PG, Gerstein M, Gingeras TR, Gonzalez D, Grimmond SM, Guigo R, 
Habegger L, Harrow J, Hubbard TJ, Iseli C, Jean G, et ah Assessment of 
transcript reconstruction methods for RNA-seq. Nat Methods 2013, 
10:1177-1184. 

Pruitt KD, Tatusova T, Brown GR, Maglott DR: NCBI Reference Sequences 
(RefSeq): current status, new features and genome annotation policy. 

Nucleic Acids Res 2012, 40(Database issue):D1 30-5. 
Flicek P, Ahmed I, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, 
Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Garcia-Giron C, Gordon L, 
HourlierT, Hunt S, Juettemann T, Kahari AK, Keenan S, Komorowska M, 
Kulesha E, Longden I, Maurel T, McLaren WM, Muffato M, Nag R, Overduin B, 
Pignatelli M, Pritchard B, Pritchard E, et ah Ensembl 2013. Nucleic Acids Res 
2013, 41 (Database issue):D48-55. 

Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, 
Aken BL, Barrell D, Zadissa A, Searle S, Barnes I, Bignell A, Boychenko V, 
HuntT, Kay M, Mukherjee G, Rajan J, Despacio-Reyes G, Saunders G, Steward C, 
Harte R, Lin M, Howald C, Tanzer A, Derrien T, Chrast J, Walters N, 
Balasubramanian S, Pei B, Tress M, et ah GENCODE: the reference human 
genome annotation for The ENCODE Project. Genome Res 201 2, 
22:1760-1774. 

Korf I: Genomics: the state of the art in RNA-seq analysis. Nat Methods 
2013, 10:1165-1166. 

Licatalosi DD, Mele A, Fak JJ, Ule J, Kayikci M, Chi SW, Clark TA, Schweitzer AC, 
Blume JE, Wang X, Darnell JC, Darnell RB: HITS-CLIP yields genome-wide 
insights into brain alternative RNA processing. Nature 2008, 456:464-469. 
Zhao K, Lu Z-X, Park JW, Zhou Q, Xing Y: GLiMMPS: robust statistical 
model for regulatory variation of alternative splicing using RNA-Seq 
data. Genome Biol 2013, 14:R74. 

Martelli PL, D'Antonio M, Bonizzoni P, Castrignano T, D'Erchia AM, 
D'Onorio De Meo P, Fariselli P, Finelli M, Licciulli F, Mangiulli M, Mignone F, 
Pavesi G, Picardi E, Rizzi R, Rossi I, Valletti A, Zauli A, Zambelli F, Casadio R, 
Pesole G: ASPicDB: a database of annotated transcript and protein 
variants generated by alternative splicing. Nucleic Acids Res 201 1, 
39(Database issue):D80-D85. 



doi:1 0.1 186/1479-7364-8-3 

Cite this article as: Hooper: A survey of software for genome-wide 
discovery of differential splicing in RNA-Seq data. Human Genomics 
2014 8:3. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



(3 BioMed Central 



